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(54) Method and apparatus lor noise reduction, particularly in hearing aids 



(57) This invention describes a practical application 
of noise reduction in hearing aids. Although listening in 
noisy conditions is diff icult for persons with normal hear- 
ing, hearing Impaired individuals are at a considerable 
further disadvantage. Under light noise condition, con- 
ventional hearing aids simplify the input signal sufficient- 
ly to overcome the hearing loss. For a typical sloping 
hearing loss where there is a loss in high frequency 
hearing sensitivity, the amount of boost (or gain) rises 
with frequency. Most frequently, the loss in sensitivity is 
only for low-level signals; high level signals are affective 



minimally or not at all. A compression hearing aid is able 
to compensate by automatically lowering the gain as the 
input signal level rises. This compression action is usu- 
ally compromised under noisy conditions. In general, 
hearing aids are of lesser benefit under noisy conditions 
since both noise and speech are boosted together when 
what is really required is a reduction of the noise relative 
to the speech. A noise reduction algorithm with the dual 
purpose of enhancing speech relative to noise and also 
providing a relatively clean signal for the compression 
circuitry is described. 
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Description 
- FIELD OF THE INVENTION 

5 [0001] This invention relates to noise reduction in audio or other signals and more particularly relates to noise re- 
- duction in digital hearing aids. 

BACKGROUND OF THE INVENTION 

10 [0002] Under noisy conditions, hearing impaired persons are severely disadvantaged compared to those with normal 
hearing. As a result of reduced cochlea processing, hearing impaired persons are typically much less able to distinguish 
between meaningful speech and competing sound sources (i.e., noise). The increased attention necessary for under- 
standing of speech quickly leads to listener fatigue. Unfortunately, conventional hearing aids do little to aid thisproblem 
since both speech and noise are boosted by the same amount. 

15 [0003] Compression algorithms used in some hearing aids boost low level signals to a greater extent than high level 
signals. This works well with low noise signals by raising low level speech cues to audibility. At high noise levels, 
compression performs only modestly since the action of the compressor is unduly influenced by the noise and merely 
boosts the noise floor. For persons that frequently work in high ambient sound environments, this can lead to unac- 
ceptable results. 

20 

BRIEF SUMMARY OF THE INVENTION 

[0004] The present invention provides a two-fold approach to sound quality improvement under high noise situations 
and its practica! implementation in a hearing aid. The present invention removes noise from the input signal and controls 
25 the compression stage with a cleaner signal. The signal for amplification (the upper path) is, optionally, processed with 
a different noise reduction algorithm. Under certain circumstances, it may be desirable to use the same noise reduced 
signal for application and compression control in which case the two noise reduction blocks merge. In another instance, 
it may be desirable to alter or eliminate the noise reduction in the upper path. 

[0005] Clearly, noise reduction is not suitable for all listening situations. Any situation where a desired signal could 
so be confused with noise is problematic. Typically these situations involve non-speech signals such as music. A remote 

control or hearing aid control will usually be provided for enabling or disabling noise reduction. 

[0006] The present invention is based on the realization that, what is required, is a technique for boosting speech 

or other desired sound source, while not boosting noise, or at least reducing the amount of boost given to noise. 

[0007] In accordance with a first aspect of the present invention, there is provided a method of reducing noise in a 
35 signal, the method comprising the steps: 

(1) supplying the input signal to an amplification unit; 

(2) subjecting the input signal to an auxiliary noise reduction algorithm, to generate an auxiliary signal; 

(3) using the auxiliary signal to determine control inputs for the amplification unit; and 

40 (4) controlling the amplification unit with the control signals ; to generate an output signal with reduced noise. 

[0008] Preferably, the input signal is subjected to a main noise reduction algorithm, to generate a modified input 
signal, which is supplied to the amplification unit. The main and auxiliary noise reduction algorithms can be different. 
[0009] In accordance with another aspect of the present invention, there is provided a method of reducing noise in 
45 an input, audio signal containing speech, the method comprising: 

(1) detecting the presence and absence of speech utterances; 

(2) in the absence of speech, determining a noise magnitude spectral estimate; 

(3) in the presence of speech comparing the magnitude spectrum of the audio signal to the noise magnitude 
so spectral estimate; 

(4) calculating an attenuation function from the magnitude spectrum of the audio signal and the noise magnitude 
spectral estimate; and 

(5) modifying the input signal by the attenuation function, to generate an output signal with reduced noise. 
55 [0010] Preferably, the attenuation factor is calculated in accordance with the following equation: 
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H(/) = 



where H(/) is the attenuation function : IX(/)I is the magnitude spectrum of the input audio signal; IN(/)I is the noise 
magnitude spectral estimate, p is an oversubtraction factor and a is an attenuation rule, wherein a and p are selected 
to give a desired attenuation function. The oversubtraction factor 0 Is, preferably, varied as a function of the signal to 
to noise ratio, with p being zero for high and low signal to noise ratios and with p being increased as the signal to noise 
ratio increases above zero to a maximum value at a predetermined signal to noise ratio and for higher signal to noise 
ratios p decreases to zero at a second predetermined signal to noise ratio greater than the first predetermined signal 
to noise ratio. 

[0011] Advantageously, the oversubtraction factor p is divided by a preemphasis function P(f) to give a modified 
15 oversubtraction factor 0(f), the preemphasis function being such as to reduce p at high frequencies, to reduce atten- 
uation at high frequencies. 

[0012] Preferably, the rate of the attenuation factor is controlled to prevent abrupt and rapid changes in the attenuation 
factor, and it preferably is calculated in accordance with the following equation where G n (f) is the smoothed attenuation 
function at the nth time frame: 



G„(0=(1-T>H{/)+7G fr i(0 



[0013] The oversubtraction factor p can be a function of perceptual distortion. 

25 [0014] The method can include remotely turning noise suppression on and off. The method can include automatically 
disabling noise reduction in the presence of very light noise or extremely adverse environments. 
[0015] Another aspect of the present invention provides for a method of determining the presence of speech in an 
audio signal, the method comprising taking a block of an input audio signal and performing an auto-correlation on that 
block to form a correlated signal; and checking the correlated signal for the presence of a periodic signal having a pitch 

30 corresponding to that for speech. 

[0016] In a further aspect the present invention provides an apparatus, for reducing noise in a signal, the apparatus 
including an input for a signal and an output for a noise reduced signal, the apparatus comprising: (a) an auxiliary noise 
reduction means connected to the input for generating an auxiliary signal; and (b) an amplification means connected 
to the input for receiving the original input signal and to the auxiliary noise reduction means, for receiving the auxiliary 

35 signal, the amplification means being controlled by the auxiliary signal to generate an output signal with reduced noise. 

BRIEF DESCRIPTION OF THE DRAWING FIGURES 

[001 7] For a better understanding of the present invention and to show more clearly how it may be carried into effect, 
40 reference will now be made, by way of example, to the accompanying drawings in which: 

Figure 1 is a conceptual blocked diagram for hearing aid noise reduction; 
Figure 2 shows a detailed blocked diagram for noise reduction in a hearing aid; 
Figure 3 shows a modified auto-correlation scheme performed in segments. 

45 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0018] Referring first to Figure 1 , there is shown schematically a basic strategy employed by the present invention. 
An input 10 for a noisy signal is split into two paths 12 and 14. In the upper path 12, the noise reduction is effected as 
so indicated in block 16. In the lower path 14, noise reduction is effected in unit 18. The noise reduction unit 18 provides 
a cleaner signal that is supplied to compression circuitry 20, and the compression circuitry controls amplification unit 
22 amplifying the signal in the upper path to generate an output signal at 24. 

[0019] Here, the position of the noise reduction unit 18 provides a cleaner signal for controlling the compression 
stage. The noise reduction unit 18 provides a first generating means which generates an auxiliary signal from an 
55 auxiliary noise reduction algorithm. The auxiliary algorithm performed by unit 1 8 may be identical to the one performed 
by unit 16, except with different parameters. Since the auxiliary noise reduced signal is not heard, unit 18 can reduce 
noise v/ith increased aggression. This auxiliary signal, in turn, controls the compression circuitry 20, which comprises 
second generating means for generating a control Input for controlling the amplification unit 22. 
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[0020] The noise reduction unit 16 is optional, and can be effected by using a different nose reduction algorithm 
from that in the noise reduction unit 18. If the same algorithm is used for both noise reduction processes 16 and 18, 
then the two paths can be merged prior to being split up to go to units 20 and 22. As noted, the noise reduction in the 
upper path may be altered or eliminated. 

5 [0021] With reference to Figure 2, this shows a block diagram of a hearing aid with a specific realization of the 
-proposed noise reduction technique. The incoming signal at 10 is first blocked and windowed, as detailed in applicants' 
simultaneously filed application serial no. which is incorporated herein by reference. The blocked and windowed output 
provides the input to the frequency transform (all of these steps take place, as indicated, at 32), which preferably here 
Is a Discrete Fourier Transform (DFT), to provide a signal X(/). The present invention is not however restricted to a 

10 DFT and other transforms can be used. A known, fast way of implementing a DFT with mild restrictions on the transform 
size is the Fast Fourier Transform (FFT). The input 1 0 is also connected to a speech detector 34 which works in parallel 
to isolate the pauses in the incoming speech. For simplicity, reference is made here to "speech", but it will be understood 
that this encompasses any desired audio signal, including music. These pauses provide opportunities to update the 
noise spectral estimate. This estimate is updated only during speech pauses as a running slow average. When speech 

15 js detected, the noise estimate is frozen. 

[0022] As indicated at 38, the outputs from both the unit 32 and the voice detection unit 34 are connected to block 
38 which detects the magnitude spectrum of the incoming noise, IN(r)l. The magnitude spectrum detected by unit 38 
is an estimate. The output of unit 32 is also connected to block 36 for detecting the magnitude spectrum of the incoming 
noisy signal, IX(/)I. 

20 [0023] A noise filter calculation 40 is made based on \X{f)\ and IN(/)I P to calculate an attenuation function H(/). As 
Indicated at 42, this is used to control the original input signal X(r). This signal is subject to an inverse transform and 
overlap-add resynthesis in known manner, to give an output at 44. 

[0024] During speech utterances, the magnitude spectrum is compared with the noise spectral estimate. In general, 
frequency dependent attenuation is calculated as a function of the two input spectra. Frequency regions where the 
25 incoming signal Is higher than the noise are attenuated less than regions where the incoming signal is comparable or 
less than the noise. The attenuation function is generally given by 



30 



H(0 = 



IS(fll 2 



35 



40 



where 

H(r) is the attenuation as a function of frequency 
S(/) is the clean speech spectrum 
N(/) is the noise spectrum 
a is the attenuation rule 

The attenuation rule preferably selected is the Wiener attenuation rule which corresponds to a equal to 1 . The Wiener 
rule minimizes the noise power relative to the speech. Other attenuation rules can also be used, for example the 
spectral subtraction rule having a equal to 0.5. 

[0025] Since neither S(/) nor N(r) are precisely known and would require a priori knowledge of the clean speech and 
noise spectra, they are replaced by estimates S(f) and N(r): 



45 



IS(0l 2 = IX{0l 2 -IN(0l 2 



where X{f) is the incoming speech spectrum and N(f) is the noise spectrum as estimated during speech pauses. Given 
perfect estimates of the speech and noise spectra, application of this formula yields the optimum (largest) signaJ-to- 
50 noise-ratio (SNR). Although the SNR would be maximized using this formula, the noise in the resulting speech is still 
judged as excessive by subjective assessment. An improved frnplementation of the formula taking into account these 
perceptual aspects is given by: 
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H(/) = 



X(J) l 2 -P |N(fl 



|x(f)| 2 



where: 

P is an oversubtraction factor 
10 a is the attenuation rule 

H(r) should be between 0.0 and 1.0 to be meaningful. When negative results are obtained, H(f) is simply set to 
zero at that frequency. In addition, it is beneficial to increase the minimum value of H(r) somewhat above zero to avoid 
complete suppression of the noise. While counterintuitive, this reduces the musical noise artifact (discussed later) to 
some extent The parameter a governs the attenuation rule for increasing noise levels. Generally, the higher a is set, 
15 the more the noise is punished as X(r) drops. It was found that the best perceptual results were obtained with a = 1 .0. 
The special case of a = 1 .0 and p=1 .0 corresponds to power spectrum subtraction yielding the Wiener filter solution 
as described above. 

[0026] The parameter (3 controls the amount of additional noise suppression required; it is ideally a function of the 
input noise level. Empirically it was noticed that under very light noise (SNR > 40 dB) 0 should be zero. For lower SNR 
20 signals, the noise reduction becomes less reliable and is gradually turned off. An example of this additional noise 
reduction Is: 



P=0 forSNR<0 



SNR 

p=0 o 5 forO<SNR<5 



P=Po( 1 - ^TT 1 } for 5<SNR<40 



P=0 forSNR>40 



In this example, Pq refers to the maximum attenuation, 5.0. In effect, from SNR = 0, the attenuation p is ramped up 
uniformly to a maximum, Pq, at SNR = 5, and this is then uniformly ramped down to zero at SNR = 40. 
[0027] Another aspect of the present invention provides improvements in perceptual quality making p a function of 
40 frequency. As an instance of the use of this feature, it was found that to avoid excessive attenuation of high frequency 
information, it was necessary to apply a preemphasis function, P(f), to the input spectrum X(r), where P(f) is an in- 
creasing function of frequency. The effect of this preemphasis function is to artificially raise the input spectrum above 
the noise floor at high frequencies. The attenuation rule will then leave the higher frequencies relatively intact. This 
preemphasis is conveniently accomplished by reducing p at high frequencies by the preemphasis factor. 

45 

M= fa 

A 

where p is p after preemphasis. 

50 [0028] Without further modification, the above formula can yield noise reduced speech with an audible artifact known 
as musical noise. This occurs, because in order for the noise reduction to be effective in reducing noise, the frequency 
attenuation function has to be adaptive. The very act of adapting this filter allows isolated frequency regions of low 
SNR to flicker in and out of audibility leading to this musical noise artifact. Various methods are used to reduce this 
problem. Slowing down the adaptation rate significantly reduces this problem. In this method, a forgetting factor, y is 

55 introduced to slow abrupt gain changes in the attenuation function: 

G n (0=(i-y)H(/) + -rG n . 1 (0 
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where G n (/) and G^f/) are the smoothed attenuation functions at the nth and (n-1)'th time frames. 
[0029] Further improvements in perceptual quality are possible by making p (in addition to being a function of fre- 
quency) a function of perceptual distortion. In this method, the smoothing function (instead of a simple exponential or 
forgetting factor as above) bases Its decision on adapting G n (/) on whether such a change Is masked perceptually The 
5 perceptual adaptation algorithm uses the ideal attenuation function H(r) as a target because it represents the best SNR 
-attainable. The algorithm decides how much G n (f) can be adjusted while minimizing the perceptual distortion. The 
decision is based on a number of masking criteria in the output spectrum including: 

1 . Spread of masking - changes in higher frequency energy are masked by the presence of energy in frequencies 
10 in the vicinity - especially lower frequencies; 

2. Previous energy - changes in louder frequency components are more audible that changes in weaker frequency 
components; 

3. Threshold of hearing - there is no point in reducing the noise significantly below the threshold of hearing at a 
particular frequency; 

15 4. Previous attenuation - low levels should not be allowed to jump up rapidly - high levels should not suddenly drop 

rapidly unless masked by 1), 2) or 3). 

[0030] For applications where the noise reduction is used to preprocess the input signal before reaching the com- 
pression circuitry (schematically shown in Figure 1 ), the perceptual characteristics of the noise reduced signal are less 
20 important. In fact, it may prove advantageous to perform the noise reduction with two different suppression algorithms 
as mentioned above. The noise reduction 1 6 would be optimized for perceptual quality while the other noise reduction 
1 8 would be optimized for good compression performance. 

[0031] A key element to the success of the present noise suppression or reduction system is the speech or voicing 
detector. It is crucial to obtain accurate estimates of the noise spectrum. If the noise spectral estimate is updated during 

25 periods of speech activity, the noise spectrum will be contaminated with speech resulting in speech cancellation. Speech 
detection is very difficult, especially under heavy noise situations. Although, a three-way distinction between voiced 
speech, unvoiced speech (consonants) and noise is possible under light noise conditions, it was found that the only 
reliable distinction available in heavy noise was between voiced speech and noise. Given the slow averaging of the 
noise spectrum, the addition of low-energy consonants is Insignificant. 

30 [0032] Thus, another aspect of the present invention uses an auto-correlation function to detect speech, as the 
advantage of this function is the relative ease with which a periodic signal is detected. As will be appreciated by those 
skilled in the art, an inherent property of the auto-correlation function of a periodic signal is that it shows a peak at the 
time lag corresponding to the repetition period (see Rabiner, L.R., and Schafer, R.W., Digital Processing of Speech 
Signals, (Prentice Hall Inc., 1978) which is incorporated herein by reference). Since voiced speech is nearly periodic 

35 in time at the rate of its pitch period, a voicing detector based on the auto-correlation function was developed. Given 
a sufficiently long auto-correlation, the uncorrelated noise tends to cancel out as successive pitch periods are averaged 
together. 

[0033] A strict short-time auto-correlation requires that the signal first be blocked to limit the time extent (samples 
outside the block are set to zero). This operation is followed by an auto-correlation on the block. The disadvantage of 

^0 this approach is that the auto-correlation function includes fewer samples as the time lag increases. Since the pitch 
lag (typically between 40 and 240 samples (equivalent to 2.5 to 15 milliseconds) is a significant portion of the auto- 
correlation frame (typically 512 samples or 32 milliseconds), a modified version of the auto-correlation function avoiding 
this problem was calculated. This modified version of the auto-correlation function is described in Rabiner. L.R.. and 
Schafer, R.W., Digital Processing of Speech Signals, supra. In this method, the signal is blocked and correlated with 

45 a delayed block (of the same length) of the signal. Since the samples in the delayed block include samples not present 
in the first block, this function is not a strict auto-correlation but shows periodicities better. 

[0034] It is realized that a hearing aid is a real-time system and that all computational elements for each speech 
block are to be completed before the next arrives. The calculation time of a long auto-correlation, which is required 
only every few speech blocks, would certainly bring the system to a halt every time it must be calculated. It is therefore 
so recognized that the auto-correlation should be segmented into a number of shorter sections which can be calculated 
for each block and stored in a partial correlation table. The complete auto-correlation is determined by stacking these 
partial correlations on top of each other and adding as shown in Figure 3. 

[0035] Referring to Figure 3, input sample 50 is divided into separate blocks stored in memory buffers as indicated 
at 52. The correlation buffers 52 are connected to a block correlation unit 54, where the auto-correlation Is performed. 
55 Partial cross-correlations 56 are summed to give the final correlation 58. 

[0036] This technique quickly yields the exact modified auto -correlation and is the preferred embodiment when suf- 
ficient memory is available to store the partial correlations. 

[0037] When memory space considerations rule out the above technique, a form of exponential averaging may be 
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used to reduce the number of correlation buffers to a single buffer. In this technique, successive partial correlations 
are summed to the scaled down previous contents of the correlation buffer. This simplification significantly reduces the 
memory but implicitly applies an exponential window to the input sequence. The windowing action, unfortunately, re- 
duces time periodicities. The effect is to spread the autocorrelation peak to a number of adjacent time lags in either 

3 direction. This peak smearing reduces the accuracy of the voicing detection somewhat. 

- [0038] In the implementations using an FFT transform block, these partial correlations (for either technique given 
above) can be performed quickly in the frequency domain. For each block, the correlation operation is reduced to a 
sequence of complex multiplications on the transformed time sequences. The resulting frequency domain sequences 
can be added directly together and transformed back to the time domain to provide the complete long auto -correlation. 

10 In an alternate embodiment, the frequency domain correlation results are never inverted back to the time domain. In 
this realization, the pitch frequency is determined directly in the frequency domain. 

[0039] Since the auto-correlation frame is long compared to the (shorter) speech frame, the voicing detection is 
delayed compared to the current frame. This compensation for this delay is accomplished in the noise spectrum update 
block. 

15 [0040] An inter-frame constraint was placed on frames considered as potential candidates for speech pauses to 
further reduce false detection of noise frames. The spectral distance between the proposed frame and the previous 
estimates of the noise spectrum are compared. Large values reduce the likelihood that the frame is trufy a pause. The 
voicing detector takes this information, the presence or absence of an auto-correlation peak, the frame energy, and a 
running average of the noise as inputs. 

20 [0041] Further aspects of the invention are set out below 

[0042] in a first aspect the invention may include a method of reducing noise in a signal, the method comprising the 
steps of: 

(1) supplying the input signal to an amplification unit; 
25 (2) subjecting the input signal to an auxiliary noise reduction algorithm, to generate an auxiliary signal; 

(3) using the auxiliary signal to determine a control input for the amplification unit; and 

(4) controlling the amplification unit with the control signal, to generate an output signal with reduced noise. 

[0043] Preferably the invention includes a method wherein the input signal is subjected to a main noise reduction 
30 algorithm, to generate a modified input signal, which is supplied to the amplification unit. 

[0044] Preferably the invention includes a method wherein the main and auxiliary noise reduction algorithms are 
different. 

[0045] In a further aspect the invention may include a method of reducing noise in an input, audio signal containing 
speech, the method comprising the steps of: 

35 

(1) detecting the presence and absence of speech utterances; 

(2) in the absence of speech, determining a noise magnitude spectral estimate; 

(3) in the presence of speech comparing the magnitude spectrum of the audio signal to the noise magnitude 
spectral estimate; 

to (4) calculating an attenuation function from the magnitude spectrum of the audio signal and the noise magnitude 

spectral estimate; and 

(5) modifying the input signal by the attenuation function, to generate an output signal with reduced noise. 

[0046] Preferably the invention includes a method wherein the square of the speech magnitude spectral estimate is 
45 determined by subtracting the square of the of the noise magnitude spectral estimate from the square of the magnitude 
spectrum of the input signal. 

[0047] Preferably the invention includes a method wherein the attenuation function is calculated in accordance with 
the following equation: 

so 

55 

A 

where H(r) is the attenuation function. IX(/)I is the magnitude spectrum of the input audio signal; IN(r)l is the noise 
magnitude spectral estimate, p is an oversubtraction factor and a is an attenuation rule, wherein a and 0 are selected 
to give a desired attenuation function. 
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[0048] Preferably the invention includes a method wherein the oversubtraction factor fJ is varied as a function of the 
signal to noise ratio, with p being zero for high and low signal to noise ratios and with p being increased as the signal 
to noise ratio increases above zero to a maximum value at a predetermined signal to noise ratio and for higher signal 
to noise ratios p decreases to zero at a second predetermined signal to noise ratio greater than the first predetermined 
5 signal to noise ratio. 

- [0049] Preferably the invention includes a method wherein the oversubtraction factor p is divided by a preemphasis 
function P(r) to give a modified oversubtraction factor p(r), the preemphasis function being such as to reduce p at high 
frequencies, and thereby reduce attenuation at high frequencies. 

[0050] Preferably the invention includes a method wherein the rate of change of the attenuation function is controlled 
10 to prevent abrupt and rapid changes in the attenuation function. 

[0051] Preferably the invention includes a method wherein the attenuation function is calculated at successive time 
frames, and the attenuation function is calculated in accordance with the following equation: 



1S G n (/M1-Y)H(0 +YGn.tO 

wherein G n (r) and G^fr) are the smoothed attenuation functions atthe nth and (n-1 ) th time frames, andyls a forgetting 
factor. 

[0052] Preferably the invention includes a method wherein p is a function of perceptual distortion. 
20 [0053] Preferably the invention includes a method which includes remotely turning noise suppression on and off. 
[0054] Preferably the invention includes a method which includes automatically disabling noise reduction In the pres- 
ence of very light noise or extremely adverse environments. 

[0055] Preferably the invention includes a method which includes detecting speech with a modified auto-correlation 
function. 

25 [0056] Preferably the invention includes a method wherein the auto-correlation function comprises: 

(1) taking an input sample and separating it into short blocks and storing the blocks in correlation buffers; 

(2) correlating the blocks with one another, to form partial correlations; and 

(3) summing the partial correlations to obtain a final correlation. 

30 

[0057] Preferably the invention includes a method wherein the method is carried out by digital signal processing and 
wherein the method includes using a Fast Fourier Transform to generate the partial correlations and includes detection 
of voiced speech directly in the frequency domain. 

[0058] In a further aspect the invention may include a method of determining the presence of speech in an audio 
35 signal, the method comprising taking a block of an input audio signal and performing an auto-correlation on that block 
to form a correlated signal; and checking the correlated signal for the presence of a periodic signal having a pitch 
corresponding to that for speech. 

[0059] Preferably the invention includes a method wherein the auto-correlalion is performed on a first block taken 
from an audio signal, and a delayed block from the audio signal. 
40 [0060] Preferably the invention includes a method wherein each block is subdivided into a plurality of shorter sections 
and the correlation comprises a correlation between pairs of the shorter sections to form partial correlations, and sub- 
sequently summing the partial correlations to obtain the correlated signal. 

[0061] Preferably the invention includes a method wherein an input signal is stored as a plurality of samples in a pair 
of correlation buffers, and the auto-correlation is performed on the signals in the buffers to determine the partial cor- 
45 relations, which partial correlations are summed and stored. 

[0062] In a further aspect the invention may Include an apparatus for reducing noise In a signal, the apparatus in- 
cluding an input for a signal and an output for a noise reduced signal, the apparatus comprising: 

(a) an auxiliary noise reduction means connected to the input for generating an auxiliary signal; and 
so (b) an amplification means connected to the input for receiving the original input signal and to the auxiliary noise 

reduction means, for receiving the auxiliary signal, the amplification means being controlled by the auxiliary signal 
to generate an output signal with reduced noise. 

[0063] Preferably the invention includes an apparatus wherein the auxiliary noise reduction means comprises: 

55 

(1) detection means connected to said input and providing a detection signal indicative of the presence of a desired 
audio signal; 

(2) magnitude means for determining the magnitude spectrum of the input signal, with both the detection means 
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and the magnitude means being connected to the input; 

(3) spectral estimate means for generating a noise magnitude spectral estimate and being connected to the de- 
tection means and to the input of the apparatus; and 

(4) noise filter calculation means connected to the spectral estimate means and the magnitude means, for receiving 
5 the noise magnitude spectral estimate and magnitude spectrum of the input signal to produce the auxiliary signal 

and having an output for the auxiliary signal connected to the amplification means. 

[0064] Preferably the invention includes an apparatus which includes a frequency transform means connected be- 
tween said Input and both of the magnitude means and the spectral estimate means for transforming the signal Into 
10 the frequency domain to provide a transformed signal wherein the magnitude means determines the magnitude spec- 
trum from the transformed signal . and wherein the spectral estimate means determines the noise spectral estimate 
from the transformed signal. 

[0065] Preferably the invention Includes an apparatus wherein the noise fitter calculation means determines the 
square of the speech magnitude spectral estimate by subtracting the square of the noise magnitude spectral estimate 
15 from the square of the magnitude spectrum of the input signal and wherein the noise filter calculation means calculates 
the auxiliary signal as an attenuation function in accordance with the following equation: 



20 



30 



H(/) = 



|X(/)f 



A 



where H(/) is the attenuation function, IX(/)l is the magnitude spectrum of the input audio signaJ; IN(f)l is the noise 
25 magnitude spectral estimate, p is an overs ubtraction factor and a is an attenuation rule, wherein a and f* are selected 
to give a desired attenuation function. 



Claims 



1. An apparatus, for reducing noise in an input signal (10), the apparatus including an input for receiving the input 
signal (10), the apparatus comprising: 

(a) a compression circuit (20) for receiving a compression controi signal and generating an amplification control 
35 signal in response; 

(b) an amplification unit (22) for receiving the input signal (10) and the amplification control signal and gener- 
ating an output signal (24) with compression and reduced noise; and, 

(c) an auxiliary noise reduction unit (18) connected to the input for generating an auxiliary noise reduced signal 
(14), the compression control signal being the auxiliary noise reduced signal. 

40 

2. An apparatus as claimed in claim 1, wherein the apparatus further comprises a main noise reduction unit (16) 
connected to the input for generating a noise reduced signal (12) and supplying the noise reduced signal (12) to 
the amplification unit (22) in place of the input signal (1 0). 

& 3. An apparatus as claimed in claim 2, wherein the input signal (10) contains speech and the main noise reduction 
unit (16) comprises: 

(1 ) a detector (34) connected to said input and providing a detection signal indicative of the presence of speech; 

(2) magnitude means (36) for determining the magnitude spectrum of the input signal (IX(f)l), with both the 
50 detector (34) and the magnitude means (36) being connected to the input of the apparatus; 

(3) spectral estimate means (38) for generating a noise magnitude spectral estimate 



55 



and being connected to the detector (34) and to the input of the apparatus; 

(4) a noise filter calculation unit (40) connected to the spectral estimate means (38) and the magnitude means 
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(36), for receiving the noise magnitude spectral estimate 

and magnitude spectrum of the input signal (IX(f)l) and calculating an attenuation function (H(f)); and, 

(5) a multiplication unit (42) coupled to the noise filter calculation unit (40) and the input signal (1 0) for producing 

the noise reduced signal. 

An apparatus as daimed in claim 2, wherein the main noise reduction unit (16) and the auxiliary noise reduction 
unit (18) comprise a single unit. 

An apparatus as claimed in claim 2, wherein the auxiliary noise reduction unit (18) is different from the main noise 
reduction unit (16). 

An apparatus as claimed in claim 3, wherein the input signal (10) has a signal to noise ratio and the noise filter 
calculation unit (40) produces the noise reduced signal (12) in dependence upon the signal to noise ratio, wherein 
there is no substantial modification to the input signal (10) for very low and for very high signal to noise ratios. 

An apparatus as claimed in claim 3, which includes a frequency transform means (32) connected between said 
input and both of the magnitude means (36) and the spectral estimate means (38) for transforming the signal into 
the frequency domain to provide a transformed signal (X(f)) wherein the magnitude means (36) determines the 
magnitude spectrum (IX(f)l) from the transformed signal (X(f)), and wherein the spectral estimate means (38) 
determines the noise spectral estimate 

from the transformed signal (X(f)) in the absence of speech, the apparatus further including inverse frequency 
transform means (44) for receiving a transformed noise reduced signal from the multiplication unit (42), the inverse 
frequency transform means (44) providing the noise reduced signal (12). 

An apparatus as claimed in claim 7, wherein the noise filter calculation unit (40) determines the square of the 
speech magnitude spectral estimate by subtracting the square of the noise magnitude spectral estimate from the 
square of the magnitude spectrum of the input signal (10) and wherein the noise filter calculation unit (40) calculates 
the attenuation function (H(f)), as a function of frequency, in accordance with the following equation: 

where f denotes frequency, H(/) is the attenuation function, !X(/)I is the magnitude spectrum of the input audio 
signal; IN(r)l Is the noise magnitude spectral estlmate : 0 Is an overs ubtraction factor and a is an attenuation rule, 
wherein a and p are selected to give a desired attenuation function. 
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